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We derive an exact formula for the covariance of cartesian distances in two simple polymer models, 
the freely-jointed chain and a discrete flexible model with nearest-neighbor interaction. We show 
that even in the interaction-free case correlations exist as long as the two distances at least partially 
share the same segments. For the interacting case, we demonstrate that the naive expectation of 
increasing correlations with increasing interaction strength only holds in a finite range of values. 
Some suggestions for future single-molecule experiments are made. 

I. INTRODUCTION 

With the advent of new powerful tools for imaging and manipulation, the detection of single molecules and the 
characterization of their dynamics is a rapidly growing and developing field in experimental biophysics. Recently, 
the possibility to track molecular motion at the level of a single molecule has received substantial attention. Indeed, 
such methods can in principle provide a glimpse of a biological molecule "at work" (see [1] for a recent review). 
The complexity and variety of interactions in biomoleculcs makes it hard to study quantitatively the biomolecular 
dynamics through a theoretical model that ought to reproduce most of the experimental observations. It is thus 
in computer simulations of empirical models [2. 3] that the most considerable quantitative studies of biomolecular 
dynamics can be found. The challenge of analyzing simulation results is usually somewhat opposite to the case of 
single-molecule experiments. On the one hand, in experiments one attempts to extract as much information on the 
dynamics as possible from a small number of accessible observables. On the other hand, for the interpretation of 
numerical simulations one needs to develop schemes allowing to reduce a large amount of data resulting from the 
numerous degrees of freedom to a meaningful subset. 

Obviously, if not guided by prior motivation, the ways to achieve this reduction are manifold. A large number of 
such schemes have been proposed, involving linear and nonlinear concatenation of degrees of freedom (see e.g. [4] 
for a comparison of common approaches). Unfortunately, very few of these methods provide testable predictions on 
observables that are directly accessible in experiments. Hence, their potential for the interpretation of experiments 
probing biomolecular motion is limited. 

In this work, we explore an alternative direction for probing biomolecular motion by introducing an experimentally 
testable and hence confutable framework allowing to characterize cooperative motion in the simulation of empirical 
models for biomoleculcs. More precisely, we introduce 4-point correlation functions of distances as a new measure 
that can serve to compare results from multiple dye single-molecule Forster-Resonance Energy Transfer (FRET) 
with simulations and analytical results on models of biopolymers. While most measures characterizing the dynamics 
of biomolecules start directly from numerical observations, in this work we build the theory of 4-point correlation 
functions in the limiting cases of exactly solvable polymer models which are subsequently validated by numerical 
experiments. As the implementation of the formalism is not limited to the underlying model, applications to more 
complex models which can only be treated numerically will be reported in future work. The results on the simple 
models presented here already reveal important issues regarding the influence of interaction strength as well as the 
geometry that are relevant to the interpretation of future experiments. 

The article is organized as follows. In section II we outline the interest of choosing a 4-point correlation function 
for characterizing the collective motion of biomolecules as a generalization of the present 2-point FRET-based ex- 
periments. Focussing on analytically solvable models, we calculate both analytically and numerically the 4-point 
correlations functions of two simple polymer models: the freely-jointed chain in section III and a discrete flexible 
model with nearest-neighbor interaction in section IV. In both cases we study the convergence of covariances as 
estimated in computational studies or single-molecule experiments from finite-time averages towards the ensemble 
average determined by our exact calculations. We conclude by a discussion of our findings in respect to future single 
molecule experiments in section V. 
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II. DESIGN OF 4-POINT CORRELATION FUNCTIONS FOR BIOMOLECULES 

Present day single-molecule FRET techniques can retrieve the time-resolved distance between a donor and acceptor 
molecule. The donor/ acceptor is a dye molecule attached to the biomolecule or an unit of the biomolecule itself, 
whose dynamics reveal the motion of the underlying molecule. Recent highlights in experimental single molecule 
FRET setups include the high resolution long-time scale observation of protcin/ligand and protein dynamics [5, 6], 
and the observation of fluctuations by three-color FRET in the DNA four- way junction [7]. The measurement of 
FRET signals with more than two dye molecules in a single biomolecule has the advantage of yielding more than 
one distance at a time. Still, such approaches are technically difficult to realize, and require to overcome several 
difficulties, e.g. the selective dye labelling within the probe molecule and a more demanding theoretical framework to 
extract distance information from the multiple efficiency signals [8] to name but a few. 

Despite these experimental and theoretical challenges, it is constructive to anticipate what information is to be gained 
from such experiments, keeping in mind the rapid progress in this field over the past years. We address the question: 
how do two distances revealed by FRET measurement correlate in biomolccules? In other words, given two distinct 
pairs of points {A, B) and (C, D) in the single molecule, to what extent can correlation be expected when one meaures 
the time-dependent distances Ri = \Rab\ and R2 = |Rcr>|? 

The covariance of distances constitutes a simple measurement of this correlated motion. For the two time-dependent 
observables, its definition reads 

C{RlRlT)=[Rl{t)-l^) {Rl{t) -1^^ = Rimiit) -l^M ■ (1) 

There A[t) = ^ J*^^^ dt A{t) denotes the time average of the observable A(t) over a time period T. The average is 
assumed to start at the beginning of the measurement when t = tg, consequently, the value of C(i?f , R2, T) in principle 
also depends on the choice of t^. This dependence only vanishes in the limit of long T or averaging over different t^. 
C{Ri, R2,T) is positive if the two signals simultaneously increase/decrease and negative if they vary in the opposite 
sense. Finally, the covariance is zero if the fluctuations of these distances are uncorrelated. We further denote by 
C(i?f , the ensemble average value of the covariance to which C{Rl, i?2, T) should converge in the limit T ^ 00. 
We will moreover consider a reduction of the experimental setting, where the maximum distance for which a FRET 
signal can be obtained is limited, and calculate the covariance for any distance. 

In the following sections, we calculate the covariance C{R\^ R^) analytically and numerically for two simple polymer 
models, the freely-jointed chain which is the simplest model, and a discrete flexible model which corresponds to 
polymer chains with nearest-neighbor interactions. 

III. FREELY-JOINTED CHAIN 

Let us now derive the distance covariance C(i?f , R\) from ensemble averages for the freely jointed chain representing 
a classical polymer model without interactions. We consider a chain of N monomers of fixed length b with no explicit 
interactions between them. Each monomer vector (|ri| = h) is labelled by a discrete index. We will denote the 
distances to be considered for correlation by Ri = X]f=fei ''0 -^^2 = X^i^fcg where the order ki < ^2, ^3 ^ ^4, 
ki e {1,2, ...,iV} is assumed. This notation is illustrated in figure 1. Most of the classical results on this chain can 
be found in [9]. The A^-segment probability distribution function factorizcs due to the lack of explicit interaction. It 
reads 

P({r,}) = nf^,p(r.) = niI,^Mz^ . (2) 



A. Analytical results 

The purpose of this calculation is to understand how non-vanishing 4-point correlations can arise between distances 
in polymers even if there is no specific interaction that could be responsible for cooperative effects. We study a 
correlator between the squared distances 



C{Ri,Rl) ^ (R?R^)-(R?)(R2) 



(3) 



FIG. 1: scheme of the notations used for the freely jointed chain 



As the probability distribution function factorizes, one has C , = whenever {fci, ^2} n {^3, k^} ~ 0. 
Consequently, we only consider the case {ki, k2} H {^2, k/^} ^ 0. We split the summation in a way allowing to 
cancel the factorizable terms: 



(R1R2) 



fe3-l 

tj=ki 



k2 

1-3 ^k'A 



k2 



There, the non-factorizablc contribution arises from the overlapping sums. 
Summing up the factorizable terms, we have 



(4) 



C{RlRl) = {\ J2 ) = -6^(fc2-A;3 + l)(fc2-fc3) 

.ij=k3, i^j 



(5) 



after an explicit integration in spherical coordinates. From this simple result, we draw the following conclusions. 

i) A non-zero covariance arises for geometrical overlap in the distances i?i,i?2 along the sequence, i.e. if Ri and R2 
share at least two monomers, the covariance of the squared distances scales as the square of the overlap C. — fc2 ~ ^3 + 1 . 
The result is independent of the size of the system, or the overall length of the distances R2 themselves as it should 
be. The distances have to share at least two monomers as for as single shared monomer the auto-correlation in (RfR^) 
is still contained in the offset {R\){R\). From a physical point of view, the result is plausible as no information from 
neighboring segments can be transmitted through the chain owing to the absence of interacting forces. Only the part 
of the topology which is shared among the distances i?i,i?2, i.e. the C overlapping segments, can contribute to a 
non-vanishing correlation. 

ii) The value of the covariance is positive semidcfinitc; there is on average no anti-corrclatcd motion in the chain. 
In the interacting case this picture should change. In particular, the result should involve the whole lengths i?i,i?2 
at least for moderate interactions. Yet, strong interaction will tend to constraint the motion of the polymer into a 
single configuration. In this case, {RIR2) = {RI){RI), and C(i?i,i?2) = 0. 



B. Numerical results 



a. ensemble averages We verified (5) by simple sampling on an ensemble average of Nc randomly generated 
freely-jointed chains according to the probability distribution (2) with a fixed number of total segments N and fixed 
monomer length b. The left-hand side of figure 2 shows a comparison of the covariance values C{R\, R^) between the 
ensemble averages obtained from Nc = 5 ■ chains of length = 50 segments and the analytical result for varying 
number of overlap monomers C = 0, 1, 19. The spacing between the points increases as the covariances scales with 

in the overlap. 

b. time averages Besides a static ensemble average, we can also estimate dynamical time averages. This can 
be achieved by implementing a chain of mass points with rigid links for which we numerically solve the constrained 
equations of motion. An adequate representation for the freely-jointed chain can be obtained by simulating a chain 
of N harmonic springs using constrained molecular dynamics. Here, we use the popular Brooks-Brunger-Karplus 
algorithm for Langcvin dynamics [10] combined with RATTLE [11] to solve the equations of motion of A^-|- 1 particles 
joined by harmonic springs, applying the constraints |ri| =6 at each time-step of the integration. Note that such an 
approach has to be taken with care for several reasons. 
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FIG. 2: Left: Covariance calculated according to (5) and numerical results (simple sampling, Nc — 5-10*, N = 50, = 0, 1, ..19). 
The straight line f{x) = a; is a guide to the eye. Right: Covariance calculated according to (5) and numerical results (molecular 
dynamics, see text). +: m — 1.0, x: m — 100.0, ^ = 0, 1, ..19. 



i) Ideally, such a chain would be massless to avoid biasing the trajectory of an individual monomer by a collective 
inertial motion. This case can only be approximated if the chain is strongly coupled to a heat bath so that inertial 
effects arc overdamped, and the time interval between the configurations recorded to calculate averages is sufficiently 
long so that the monomer orientations can be effectively considered as uncoupled. 

ii) It has recently been pointed out that applying constraints to a stochastic heat bath leads to temperature dependent 
equilibrium bond lengths [12]. If the covariance was to be evaluated at different temperatures, as the bond length 
enters with a correction needs to be applied to the integration algorithm. 

All observables of the simulation are expressed in dimcnsionless units. We simulate + 1 = 51 point masses of 
equal mass m = 1.0 joint by harmonic springs (uniform spring constant D = 1.0) and an equilibrium bond length 
between two mass es b = ^ 1.0. Using a time step of At = 0.1 and a friction constant 7 = 0.1, we have 'jAt ^ 1 
and At <C (27r) / \J D jm. The initial velocities were taken from a Maxwell-Boltzmann distribution, and the chain is 
thermalized at /cbT = 0.05 for 10^ time units in order to guarantee the loss of memory of the initial condition (a 
partially stretched configuration). Configurations were recorded every 100 time units to compute averages. For a 
constant temperature trajectory of length 10^ time units, the temporally averaged kinetic energy closely approached 
the canonical expectation value. The right-hand side of figure 2 illustrates the effects of inertia on the comparison with 
the analytical results. For small masses, the overdamped dynamics of the constrained harmonic chain approximate 
the phase space sampling of the freely-jointed chain quite well at least on the timescale of the simulation. However, 
the results for a larger mass exhibits a strong deviation. The latter is observed to be more pronounced when the 
overlap C, is located at the center of the chain (data not shown) compared to the case of an overlap located at the 
ends. Indeed, the free end is only affected by inertial motion from one side. 



C. Relation of the distance covariance to the third order susceptibility in spin systems 



In classical spin systems, fourth-order correlation functions arise naturally when analyzing higher order susceptibil- 
ities as response functions to an external field. To see the analogy with the calculation of the higher-order correlations 
with polymers, we define the Hamiltonian of the system to be 



i7 = iJo + hi-Oi+h2-02 (6) 
where TJq is an unperturbed Hamiltonian of some spin chain and Oi, O2 are two observables of the system coupling 

13,*) 



. . (3 *) 

to the external fields hi, h2. We now can introduce third-order susceptibility xliki associated with mixed derivatives 



of the on-site free energy / in the two fields hi, h2: 

(3,*) d-^f I r r ksT 

In this particular case we get, for Hamiltonians with vanishing first order moments in the observables Oi and O2, 

xg;^ = (fcBr)-3 Jirn^ 1 ((Oi..Oi,.02,,02,,) - (Oi,.Oi..)(02,,02,,) - 2(Oi,,02,,)') . (8) 
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Xiijj can be related to C[R\, R\). After a summation over the indices i, j, the first two terms within the bracket in 
the previous expression correspond to C{R\tI{^) provided that the identification O ^ i? is made. The extra term 
(i?i^ii?2j) has however no correspondence in the previous calculation. In contrast to the fourth-order correlation we 
have defined above, calculating the fourth-order susceptibility would require not only the information on the distances 
|R|, but also on their orientation. Such an information cannot be accessed by FRET techniques. Hence, in practice, 
the measurement only allows to infer the covariance which is not an intensive quantity in the overlap. 
We have seen that the analogy with spin systems provides a relation between the covariance measure with a generalized 
susceptibility in a spin system. For the moment, our calculations deal with the simplest possible case of a polymer 
chain without interaction. In the following section, we go beyond this first order, calculating the covariance in a 
polymer model with nearest-neighbor interactions. This allows us to assess the qualitative and quantitative change of 
the results upon introducing interactions. The analogy with spin systems will also prove very useful in this context, 
as the starting point for our calculation are exact results known for spin systems. 

IV. DISCRETE FLEXIBLE CHAIN 

A classical system for studying polymers is the worm-like chain model (WLC). In this section, we study the behavior 
of a discrete flexible chain with nearest-neighbor interactions [13] which in the continuum limit and for large coupling 
constants yields the WLC model. A related model has been analyzed in the context of loop formation in double 
stranded DNA [14]. In contrast to the freely-jointed chain which represents a non-interacting system of stiff rods, this 
has the additional physical feature of penalizing local bending by a harmonic force deriving from the Hamiltonian 

N-l 

H = -e^(r,r,+i-fe2) (9) 

1=1 

where |ri| =b and e is a uniform coupling constant. A rescaling of the Hamiltonian H ^ H — e{N — 1)6^, makes the 

system equivalent to the zero-field limit of the classical one-dimensional Heisenberg chain with uniform coupling and 

open boundaries. The latter has been extensively studied, see e.g. [15] for a physical perspective. 

In 1994 Nakamura and Takahashi [16] reported the nonlinear susceptibility for a uniform field in this system. These 

results can thus serve as a starting point for the calculation of the distance covariance in an interacting polymer 

model. However, in order to be fully applicable, these should be generalized along the lines described in the following 

section. 



A. Analytical results 

As the calculation of the covariance for the discrete fiexiblc chain is more involved than for the freely-jointed chain, 
we start by an outline of the procedure. 
We are interested in the covariances 

CiRlRl) = {RlRl) - {Rl}{Rl) (10) 

with Ri = X^iilfci '"ij = X^iifca '^i defined previously. Averages are taken with respect to the canonical probability 
distribution function bound to the rescaled discrete flexible chain Hamiltonian 

H = -KksT fi • fi+i (11) 

j=i 

with K = tj {ksTb'^) and r = br. The squared distance correlation functions are deduced by using the isotropy of the 
zero-field Hamiltonian and Fisher's result [17] of local correlations in the model (see also Appendix A). One obtains 

(R?) = 3{RIRI) 

where u = coth(JC) — K^^ and a — k2 



_ ^ ^ 2(a+l)-(a + 3K _ . _ u{l ~ u^-^) 

~t- 1 — /ci. However, the correlator of the two squared distances 
(R?R^) = E E ((r,-r,)(r..-rO) (13) 

ij—ki kl—ks 
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cannot be directly inferred from previous results and deserves special attention. The two sums run over different 
domains, such that the use of permutation symmetry is subject to restrictions. The isotropy of the interactions can 
be used to reduce the number of correlators to be evaluated to two, i.e. {rfr^r^rf) and {r^rjrf.rf) . All the other 
correlators are deduced from a rotation of the reference frame (the latter being a symmetry of the Hamiltonian). 
(r^r^r^rf) has been derived in [16] (see also [18] cited therein), whereas (rfr^r^rf) is not known as it usually does 
not arise in problems dealing with magnetism where the applied field is uniform. Combining these observation, we 
get 

(R?R2) = E E (6(^"^^fe^f)+3(rfr|r|rf}) . (14) 

ij—ki kl — ks 

In A, we derive the correlators (rfr^r^rf) and {rfr^r^rf) for i < j < k < I. The result reads 

(rfr|r^rf) = {rfr^r^rf) = ^u^-^+'-V'^ . (16) 
We also recall here the result of [16] on {rfr^r^rf): 

{r!r]rlrt) = ^vP-^ Qz;'^-^ + l) u'-'^ . (17) 
In the above formulae, we have introduced 

u 1 
u = 1 - 3— , w; = - {K^^ coth(i4:) - A'"2) 

t = {3 + K"^ -3Kcoth(K)) . (18) 

With these exact results, we can decompose the summations involved in (10) by symbolic computation [19]. The 
results for varying coupling constant K and chain overlap ^ are presented in the following section along with the 
results obtained from direct numerical computation. 



B. Numerical results 



c. ensemble averages We numerically evaluate the covariance using a Monte Carlo sampling of the classical 
one-dimensional Heisenberg model. A new trial configuration was obtained from the arbitrary reorientation of a 
randomly chosen spin. For small interaction strengths, such a sampling scheme achieves a high acceptance ratio. 
However, at stronger couplings, this ratio quickly drops. We evaluate the covariance C(i?f,i?2) with i?i = X^ii^fci 
and Ri — X^^fci which we will denote from now on with the shorthand Ckik2k3k4{K) to indicate its dependence 
on the coupling strength K. On the left hand side of figure 3, we show a comparison between the analytical and 
numerical results for Heisenberg model or, equivalently, the discrete flexible chain as a function of the coupling 
strength K for a small chain with N = 5 segments. We used 10^ Monte Carlo steps for each value of the coupling 
constant, and chose (fci, fc2, fca, fc4) = (1,3,2,4), i.e. C = 2. Notice that for if = 0, we recover the limit of the 
non-interacting freely-jointed chain, Ci324(0)/&'' = 4/3. For small coupling K, the correction to the zero-coupling 
limit is linear, but the increase is sharper for larger K and passes through a maximim. Then, for large values of 
K, the covariance decreases with K towards zero. These results can be interpreted in view of the stiffening of the 
chain with increasing K: Whereas a large coupling constant initially tends to enhance cooperative motion among 
the segments, it also reduces the volume of the configurational space sampled on average as the energy penalty 
for overlapping monomers increases. This situation is schematically represented on the left hand side of figure 3, 
where snapshots of typical configurations of a dynamical implementation of the discrete flexible chain (discussed in 
the next subsection) for values of K below and above the maximum are shown. These can been understood more 
quantitatively by studying the joint probability distribution function P{R\, R^) of the two distances. Figure 4 showns 
P{R\tR^) in the previous case {ki, k2, k^, k^) = (1,3,2,4) for different values of the coupling constant K: at zero 
interaction in the freely-jointed limit (left-hand panel), at if = Kmax ~ 1.126 which maximizes Ci324(if ) in figure 3 
(middle panel), and at if = 8 (right-hand panel). One observes that the transition Ci324(if ) for large K goes 
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FIG. 3: Numerical comparison of the analytical result for the freely-jointed chain (solid line) with MC sampling of the discrete 
flexible chain as a function of interaction strength K; Left: covariance Ckik2k3k4{K) for (fci, ^2, fca, ^4) = (1,3,2,4); points: 
results from MC sampling of a chain of length N = 5 with 10* steps; line: analytical results using (10), (12), (13), (15-17); Right: 
Covariance Cik2k3(k^+i){K) for K with varying overlap = k2 — ks + 1 from MC sampling (points); the solid line shows the 
first order linear approximation Cik2k3{k2+i){K) /b'^ ~ iC(C ~ 1) + f ^(C ~ 1)^ 




FIG. 4: Joint probability distribution function P{Ri, R2) for different coupling constants in the case (fci, ^2,^3, ^4) = (1, 3, 2, 4) 
obtained from MC sampling as in figure 3. From left to right, the coupling constants are K = 0, K = Kmax ~ 1.126, K = 8, 
gray and black colors indicate regions of higher probability (each panel normalized by maxiii,_R2 P{Rl, R2))- 



along with a restriction to a relatively small part of configurational space associated to a stretched configuration of 
the polymer. 

For small values of the coupling constant, the first order linear correction term can be estimated by per- 
forming simulations for varying overlap ^ = A:2 ^ ^3 + 1- On the right hand side of figure 3, we evaluate the covariance 
function Ck-i^k^HkiiK) for K — 0.01 fixed, and with C = 2, 15 along with a corresponding increase of the chain size 
L. The results are fitted by the first order linear approximation function Cik2k3(k2+i) (^)/^^ ~ §C(C~ 1) + |^(C~ 1)^- 

d. time averages Similarly to the non-interacting case, we would like to estimate the covariance from time 
averages using a dynamical implementation of the model. As outlined in section IIIB, we implement a distance 
constrained chain of mass points, while adding an additional contribution to the potential energy function Vkp = 
— /v^^^j^^ (fi • fi+i — 1) that accounts for the energy penalty for monomer overlap in the discrete fiexible chain. In 
the case K — Q, we recover the previous non-interacting case while for non-vanishing coupling values, the discrete 
fiexible chain generates an orientation-dependent force. Here, we used a chain of = 7 segments to estimate the 
running-time average 

Ck,k2k3kAK,T) = Bm-l?,Bi (19) 

with A{t) = y Jt^'^'^ dt' A(t') as defined previously. As an example, we chose (fci, fc2, ^3, fc4) = (1,4,2,5). The 
simulation parameters were the same as described in section III B except for a smaller time-step dt = 0.02 and an 
equilibration time of 5 • 10'^ time units. Configurations were recorded for averaging every 20 time-steps. Figure 5 
shows the evolution of the running-time average on a logarithmic scale for different values of the coupling constant 
{K = 0.8,1.6,2.4,3.6,4.4,5.2,6,8,10). Notice that these couplings include values below and above the coupling 



8 




FIG. 5: Left: Time convergence of running time average of the covariance Ckik2k3ki{K,T) for different coupling constants K. 
Here, (fci, fc2, fca, ^4) = (1,4,2,5) and N — 7; the blue lines and points show the running time estimation of the covariance 
averaged over 1000 independent initial conditions, the red lines indicate the exact result. Right: Time convergence of the 
standard deviation of the distribution of covariances from different initial conditions. In both figures, the unit of time is 20dt 
(see text). 



P (C(K,T) ) 




FIG. 6: Probability distribution function of the time averaged covariance function obtained from 10000 independent initial 
conditions, {ki, k2, fca, fc4) = (1, 5, 4, 6), N = 7 and K = 0.4. The red line indicated the analytically predicted value. The unit 
of time is 20dt (see text). 



maximizing the covariance. For each K, the results were averaged over 1000 initial conditions to avoid statistical 
dependence of the results on a particular initial condition for the trajectory. While all simulations converge towards 
values close to the analytical results indicated by the red lines on the left hand side of figure 5, one observes that within 
the time of simulation this value is not exactly reached, and the relative changes towards long times are small. This 
can be seen more quantitatively by taking a closer look at the standard deviation of the covariance at different points 
in time. On the right hand side of figure 5, we computed the standard deviation a of the covariance estimated from 
averages over different initial conditions. For long times, the variance decreases approximately as T"^/^, indicating 
a slow change in time. For these timescales, increasing time is equivalent to increasing the number of statistically 
independent samples, such that the overall convergence behavior is Gaussian. This convergence behaviour is not 
limited to this particular observable, but can also obtained for other variables such as Rf. An illustration of the 
convergence of this distribution is given in figure 6, where the probability distribution function of the covariance 
is shown as a function of time based on an estimation from 10000 initial conditions. The scale of binning of the 
histograms at each time step is based on the large distribution of values at the smallest timescale. One observes 
that the onset of the scaling with T"^/^ in figure 5 is reflected here by the evolution toward distributions with a well 
defined peak and decreasing width at half maximum for long times. 



V. SUMMARY AND OUTLOOK 



We introduced the covariance of distances as a simple measure allowing to characterize correlated and uncorrelated 
motion in polymers and biomolecules. In this work, we focussed on the exactly solvable cases only. We considered 
two highly simplified models of homogeneous polymers. These can be though of as some zeroth-order approximation 
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to the real situation. A result which may seem surprising at first sight is that even a system without interacting 
forces can display a non-vanishing covariance stemming merely from a geometric overlap in the distances. Moving to 
a more realistic system, the discrete flexible chain, we have shown that the results of the non-interacting chain can be 
recovered as a special case of the covariance for the system with interaction. Also, we have shown that for small values 
of the coupling constant, i.e. for small excluded-volume penalties or, equivalently, high temperatures, the correction 
to the results of the freely-jointed chain scales linear with the coupling constant. This effect is due to an increase 
of correlated motion. This trend is yet not monotonic. With increasing coupling constant, the covariance at small 
couplings reaches a maximum and then decreases to zero. As a consequence, there exists a finite value of K for which 
the ensemble average of the covariance coincides with the value obtained at zero interaction. A simple interpretation 
of this observation is that though stronger coupling increases the cooperativity, it also reduces the configurational 
space sampled as the penalty of the excluded volume increases. Consequently, the average amplitude of distance 
fiuctuation decreases with K. In section IV, we also analyzed the convergence of the time average of the covariance 
measure towards the ensemble average obtained from an exact calculation. While the quantitative difference between 
the predicted value and the results from numerical experiments certainly depend on the coupling to the stochastic heat 
bath, the qualitative results indicating that an algebraic decay in the variance would in principle occur for any type 
of canonical sampling provided the time scale is long enough so as to decorrelate events along individual trajectories. 
Regarding the situation in single molecule experiments, though our results have been in a highly idealized setting, 
some notable conclusions can be drawn. In single molecule FRET experiments, the acquisition time of the intensity 
signal is always limited either by instrumental factors or the destruction of the fiuorescent marker by photobleaching. 
Accordingly, one can only expect to find meaningful results if the data acquisition time is much longer than the slowest 
timescale of the underlying dynamics of the molecule, and if the experiment can be sufficiently repeated so as to allow 
one for averages over independent trajectories. In the present model where the interaction is dominated by a single 
parameter, it appears that the approach to convergence is not affected by the strength of interaction. Though the 
expectation value of the covariance varies with K , the approach towards this value based on running time averages is 
-ft'-independent. The convergence behavior in this model is not sufficient to draw conclusions on the physical state of 
the system (strongly /weakly interacting), but the mean value of the covariance is related (figure 3 in section IV). It 
therefore appears compulsory for an experiment aiming to distinguish among different states to be calibrated on an 
absolute intensity scale. From the results of the idealized models considered in this work, a promising future direction 
appears to be to apply the covariance analysis to more complex models of biopolymers where analytical approaches are 
not feasible anymore. Even if the ensemble average cannot be evaluated exactly, numerical experiments characterizing 
the dynamical convergence, similar to the ones performed on the polymer models in this article, are still feasible on 
long timescales. As an example, for proteins, a natural question arises whether the convergence of the covariance 
of distances within secondary structure elements differs significantly form the relaxation timescales on the level of 
tertiary structure. In particular, it would be promising to look at these cases under different physical conditions 
related to the folding process. The results of such a study might provide insight on how to choose, from a very large 
number of possibilities, particularly interesting locations for fiuorescent labelling in single molecule spectroscopy, and 
hence getting a closer view on "molecules at work" beyond autocorrelation functions. 
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APPENDIX A: DERIVATION OF 4-TH ORDER ELEMENTS FOR THE DISCRETE FLEXIBLE CHAIN 

In what follows, the derivation of the 4-th order elements required for the evaluation of the covariances in section 
IV is outlined in a more general way than required for the purpose of the present work by considering a chain with 
non-homogeneous couplings. 

1. The set of angles for integration 

We choose the following coordinate system for integration (see illustration 7). All unitary vectors are marked 
by their polar angles with respect to a fixed frame (8f, $f). These are the so-called laboratory angles. To simplify 
the notation, we adopt the definition |rj| = 1 omitting the hat. We also choose a reference vector labelled by its 
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laboratory angles. All other vectors are marked by their polar coordinates {9j, (pj) with respect to the axial vector 
rj_i if j > i and rj_|_i ii j > i. Then for j > i, the vector rj^i is deduced from rj by a rotation of rj of axis z A Vj 
and angle dj+i followed by a rotation of axis Vj and angle (pj+i- The situation is analogous for j < i. A rotation of 
angle 6 and around a unitary axis a acts on a vector x according to 

Ra.9 • X cos (9) x + (1 - cos (9)) (a • x) a + sin {9) (a A x) . (Al) 

An association of the both aforementioned rotations gives 

/ sin($j)sin(^j+i) \ 

= (cos (Qj+i) + sin {9j+i) cot (9^) cos (ifij+i)) rj - sin cos sin {(pj+i) . (A2) 

\ cos((^j+i)/sin(ej) J 

(A2) constitutes a recursion relation for the laboratory angles in terms of those located lower in the chain 

and of the local angles {9j ,(pj). The longitudinal recurrence is simple 

cos (Qj+i) = cos (Oj+i) cos (9j) — sin {9j+i) cos {fj+i) sin (Qj) , (A3) 

whereas the transverse one is a little more involved: 

-rn^rfl W-L 4- ^fl ^ COS ((^j + i ) COS ($j ) COS (9^ ) - sin ((^j + l) sin ($j ) \ 

- COS + sm {9,+,} ^^^^^^ ^.^ ^^^^ ^^^^ ^ ^.^ ^^^^^^ ^^^^ j . (A4) 

These information are enough to compute the two and four-point correlation functions. The latter will be computed 
for an inhomogencous Hamiltonian 

JV-l 

H = -ksT J2 bj+irj ■ r,+i |r, | = 1 . (A5) 

The results of interest will follow either by taking the homogeneous limit, what we do in section IV where bi = K, or 
by assuming that the couplings {bi} are statistical independent variables, and taking the ensemble average over such 
a distribution. We start by re-deriving the known results for the two-point functions as the techniques involved there 
will be used for the higher-order correlators. 

2. The two-point function 

Let duj = sin {9) d9dip and dfl = sin (9) d9d<f>. One has 

J dLue'>'°<''> = 47rshc (6) = 47r^^^^ . (A6) 
Then, for i < j, one has for the only non-trivial correlator 

/ z z\ f dtJi dwi_i dQi dui+i dujN ^ ^ Iv-^. i n \ , i. I n \ \ 

^'^'^^ = J 4^5^ ■ ■ ■ 4^5^ 1^ 4.shc (6.^0 ■ ■ ■ 4.shc {b^) [ g ^^'^^ '^"^ ^'^^ + ^''^ j 

j 

dn, dLo,+i dijj S , , 

47r 47rshc(6,+i) 47rshc(6j) ^ ' 

After this first trivial integration, in order to deal with the coupled integrals, we use the recursion of the angles. As 
9j_i is independent of {9j, (pj) we can integrate on the latter variables. We set 
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and since J duj sin (6) cos (tp) c'"=°'^(^) = 0, we get 

{r-r^)-u(b,){r^r^_^= f[ «(6,) / ^ cos^ (6) = 1 f[ «(6,) (A9) 

k=i+l k=i+l 

recovering the result of Fisher [17]. A similar calculation can be done for the four-point functions. Here and in the 
following, we assume that i < j < k < £. Due to a trivial integration on the angles located before the i*'^ site, one has 

{rTrj'rl''r1')b = {r^W-W-" r~' }^ . (AlO) 

Where j = j — i + 1 and bk = bi^k-i, Ofc = Oi+fc-i- Hence, we can compute the correlators starting from the first site 
and then, if needed, perform the aforementioned shifts. 

We start with the XXZZ case. There, we choose rj to be the reference vector for all of the "moving frame" angles. 
Thus 



The recursion relations for the angles read 



= cos(6lp)r;j+i+/^(cos((^p),sin(v3p)) P e {1 ; j - Ij , (A12) 
= cos(6lp)r^_i+/. (cos(y5p),sin(^p)) pG|fc + l;£]. (A13) 

The functions fx^z are linear in cos (<^p) and sin (ipp). Thence, their integrals over the azimuthal angle ipp give zero. 
We are thus led to the same recursion as for the two-point function. Eventually, 

{rtr^rlrt) = ^ u (5,) \{ u (6,) / ^ (.j)^ [rlf \{ • (AM) 

^ — O 7.. I 1 ^ — ^ II VP/ 



p=2 p=k+l 

The recurrence equation for the squared laboratory angles reads: 



i:rlf = (cos^ ^'fc - sin^ ^fe cos^ Vk) + sin^ 9k cos^ '^^^\ sin26'fc costpfc sin2efe_i . (A15) 

The linear term in cos Lpk can be dropped after an integration over the azimuthal angle, so that after agreeing upon 



we get 



/HO T > p^p cos(6ip) 

17 n 1:1(6) ■ 
p— j-t-l VP/ 

This is a linear recursion whose homogeneous solution is Ik = Y\ (^fc) -(j- Setting Ik = Ik Y\ i^p) S^t 

p=j-i-i p=i-i-i 
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Thus, the overall solution reads 

fc fe k 

n E n ^(^^) ■ 
p=j+i p=j+i i=p+i 

where we have written the explicit value of Ij . This expression can be slightly simplified if one takes averages over the 
variables b. Assuming that these arc independent random statistical variables, we get from averaging [•] over disorder 

Ik = ^[v (b)]"-' + [w (b)] E (bt'" = (b)]'-' + ib)] ^^i^ir^^ ■ (A21) 

Note that the results for a homogeneous Hamiltonian follow from the distribution 5 {pi — A'). After performing the 
appropriate shifts we get 

{rtry,rl) = [h(&)F-+^-^- [^[v{b)f-' + . (A22) 

We now pass to the XZXZ and XZZX cases. The first reduction is identical with the previous case. Namely 

— p=k+l •' P=3 + l ^ ^' 



p=2 



These equalities are due to the fact that, for a — one has a relation of the type 

27r 

r° — cos 9ir1_i + /° (cos {ipt) , sin ((/?<>)) with y (i(y9£ /° (cos {ipi) , sin — . (A24) 



Thus moving r| to the position k produces the same coefficient as moving rf to the position k. When writing a 
recursion for rf, rf. it is enough to keep the terms that produce non-zero value after an integration on the azimuthal 
variables ^pk- Thus, up to integration vanishing terms, 

^fc^g- 'H"'"' (3cos^g.-l) . (A25) 



Setting 



we obtain 
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p=2 p=k+l P=i+1 

Thus performing the ensemble average as well as the shifts, we get 



J ^ k 

)=iTri"(M n n (A27) 



^ 3 



(rf r-|r^r|) = ^[u {b)Y-^+'-'^[t {b)f-^ . (A28) 
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